iFish: predicting the pathogenicity of human nonsynonymous variants using gene-specific/family-specific attributes and classifiers
نویسندگان
چکیده
Accurate prediction of the pathogenicity of genomic variants, especially nonsynonymous single nucleotide variants (nsSNVs), is essential in biomedical research and clinical genetics. Most current prediction methods build a generic classifier for all genes. However, different genes and gene families have different features. We investigated whether gene-specific and family-specific customized classifiers could improve prediction accuracy. Customized gene-specific and family-specific attributes were selected with AIC, BIC, and LASSO, and Support Vector Machine classifiers were generated for 254 genes and 152 gene families, covering a total of 5,985 genes. Our results showed that the customized attributes reflected key features of the genes and gene families, and the customized classifiers achieved higher prediction accuracy than the generic classifier. The customized classifiers and the generic classifier for other genes and families were integrated into a new tool named iFish (integrated Functional inference of SNVs in human, http://ifish.cbi.pku.edu.cn). iFish outperformed other methods on benchmark datasets as well as on prioritization of candidate causal variants from whole exome sequencing. iFish provides a user-friendly web-based interface and supports other functionalities such as integration of genetic evidence. iFish would facilitate high-throughput evaluation and prioritization of nsSNVs in human genetics research.
منابع مشابه
Down-regulation of HSP40 gene family following OCT4B1 suppression in human tumor cell lines
Objective(s): The OCT4B1, as one of OCT4 variants, is expressed in cancer cell lines and tissues more than other variants and plays an important role in apoptosis and stress (heat shock protein) pathways. The present study was designed to determine the effects of OCT4B1 silencing on expressional profile of HSP40 gene family expression in three different human tumor cell lines. Materials and Met...
متن کاملMAPPIN: a method for annotating, predicting pathogenicity and mode of inheritance for nonsynonymous variants
Nonsynonymous single nucleotide variants (nsSNVs) constitute about 50% of known disease-causing mutations and understanding their functional impact is an area of active research. Existing algorithms predict pathogenicity of nsSNVs; however, they are unable to differentiate heterozygous, dominant disease-causing variants from heterozygous carrier variants that lead to disease only in the homozyg...
متن کاملKCNE1 and KCNE2 variants in Patients with Long QT Syndrome
Introduction: Long QT syndrome (LQTS) is a type of ventricular arrhythmia characterized by prolonged QT intervals on electrocardiogram or delay in ventricular repolarization and it can lead to syncope, seizure and sudden cardiac death. Here, KCNE1 and KCNE2 variants are studied among Iranian affected families with this syndrome. Materials and Methods: Fifty patients referring to Rajaei Cardiov...
متن کاملIdentification of a Novel Splice Site Mutation in RUNX2 Gene in a Family with Rare Autosomal Dominant Cleidocranial Dysplasia
Introduction: Pathogenic variants of RUNX2, a gene that encodes an osteoblast-specific transcription factor, have been shown as the cause of CCD, which is a rare hereditary skeletal and dental disorder with dominant mode of inheritance and a broad range of clinical variability. Due to the relative lack of clinical complications resulting in CCD, the medical diagnosis of this disorder is challen...
متن کاملMissense variant pathogenicity predictors generalize well across a range of function-specific prediction challenges.
The steady advances in machine learning and accumulation of biomedical data have contributed to the development of numerous computational models that assess the impact of missense variants. Different methods, however, operationalize impact differently. Two common tasks in this context are the prediction of the pathogenicity of variants and the prediction of their effects on a protein's function...
متن کامل